BIOTECHNOLOGY #3 gp--^ 




RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: patin21help(%u$pto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 c-mail help: patin3help(gluspto.gov or phone 703-306-4119 (R Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3,0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARJC OFFICE WEBSITE. SEE BELOW: 



The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Date Processed by STIC: 



Application Serial Number: Q 



Source: 




Checker Version 3.0 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALrilA" HEADERS. WHICH WERE INSERTED BV PTO SOFTWARE 



I Wrapped Nuclcics 

Wrapped Aminos 



The numbcr/lcxl at the end of each line 'Nvrappcd" down to the next line. TTiis may occur if your file 
was retrieved in a word processor tflcr creating it. Please adjust your right margin to .3, this will 
prevent "wrapping." 



2 liwalid Line Lcnglli Tlic rules rcquu-c that a line not exceed 72 characters in length. This includes ulntc spaces 

3 ^ Misalifincd Amino 
I Nujnbcring 

4 



The numbering under each 5** amino acid is misaligned. Do not use lab codes between numbers, 
use space characters, instead. 



Non-ASCII 



Variable Lenglli 



6 PalenQn 2.0 

"bug" 



The subinillcd file was not saved in ASCII(DOS) text, as required by iIk Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII tcxL 

Sequaicc(s) contain n*s or Xaa's representing more iKm one residue. Per Sequence Rules. 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patcnlln version 2.0 has caused tlie <220>-<223> section to be missing from amino acid 

sequcnccs(s) . Normally. Patentln would automatically generate this section from \}\c 

previously coded nucleic acid sequence. Please nvinually copy the relevant <220>-<223> section to 
the subsequent anuno acid sequence. Tliis applies to the mandatory <22(>>-<223> sections for 
Artificial or Unknown sequences. 



7 Skipped Sequences Scqucncc(s) 

(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shouii) 
(i) SEQUENCE CliARACTCRlSTlCS: (Do not insert any subheadings under tJiis heading) 

(xi) SEQUENCE DESCRIPnON:SEQ ID NO:X: (insert SEQ ID NO where "X" is shou-n) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include Uic skipped sequences. 



8 Skipped Sequences Sequencc(s) 

(NEW RULES) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<2 1 0> sequence id number 
<400> sequence id number 
000 

9 Use of n*s or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present. 
In <220> to <223> section, please explain location of n or Xaa, and wl\ich residue n or Xaa represents. 



(NEW RULES) 

10 Invalid <213> 

Response 

1I_^ Use of <220> 



12 Patailln 2.0 

"bug" 



Per 1.823 of Sequence Rules, the only valid <2I3> responses are: Unknown. Artificial Sequence, or 
scientific name (Gaius/specics). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequcncc(s) 



missing tlic <220> 'Teature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic nuterial in <220> to <223> section. 
(See-Tcderal Register." 06A) 1/1998. Vol. 63. No, 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patenlln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on row sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



AMC - Biotechnology Systems Branch -06/04/2001 



GENERAL INFORMATION: 
( i ) 

APPLICANT: PEREGRINO FERREIRA, Paulo; 
GESSIEN KROON, Erna ; 

PIMENTA DOS REIS, Karlisson Jennner; 
BIAS FORTES FERRAZ , Isabella; 
CERQUEIRA LEITE, Romulo . 
(ii) 

TITLE OF INVENTION: Method and composition for the diagnosis of equine 
infectious anemia virus disease by using the recombinant capsid protein virus 
(p26) 
(iii) 

NUMBER OF SEQUENCES: 1 
(iv) 

CORRESPONDENCE ADDRESS : 
(A) 

ADDRESSEE: Universidade Federal de Minas Gerais - CTIT 
(B) 

STREET: Avenida Antonio Carlos, 6627 Bairro Sao Francisco 
(C) 

CITY: Belo Horizonte 
(D) 

STATE: Minas Gerais 
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(3) Computer: Apple Macint6sh; 

(I) Operating System: Macintosh: 

(ii) Macintosh-File Type: text with line 
termination 

(iii) Line Tennlnator:*Pre-denned by 
text type file; 

(iv) Pagination; Pre-defined by text 
type file; 

(v) Endnof-nie: Pre-defined by text 
type file; 

(vi) Media: (A) Di8ketl-<J.50 inch, 400 
Kb storaoe; 

(B) Diskette— <J.50 Inch. 800 Kb 
storage; 

(C) Diskette— 3,50 Inch, lA Mb 
storage; 

(vli) Print Command: Use PRUSTT . 
command from any Macintosh 
Application that processes text files, 
such 88 MacWrile or Teach Text; 

(4) Magnetic tape: 0.5 inch, up to 2400 
feet; 

(i) Density: 1600 or 0250 bits per Inch. 
0 track; 

(II) Focmat;raw. unblocked; 

(III) Line Terminator ASCII Carriage 
Return plus 'optional ASCII Line Feed; 

(iv) Pagination: ASCII Form Feed or 
Series of Line Terminators; 

(v) Print Command (Unix shell vcrBlon 
given here as sample response — mt/ 

de v/rmtO; Ipr/deY/rintO): 

(g) Computer readable forms that arc 
submitted toihe Office will not be 
returned to the applicant. 

(h) All computer readable forms shall 
have a labelj)€niiahently affixed thereto 
oh vhlch has bcen'hiil^d printed or 
typed, a description of the format of the 
computer readable form' as well as the 
name of the applicant, the tide of the 
Invention, the date on which the data 
were recorded on the computer rijadable 
form and the name and type of Computer 
and operating system which generated 
the files on the computer readable form. 
If all of this Informatlbn cannot be 
printed oh a label affixed to the 
oompoler readable form, by reason of 
size or otherwise, the label shall Include 
the name of (he applicant and the title of 
the Invention and a reference number, 
and the additional information may be 
provided on a container for the 
computer readable form with the name 
of'lhe applicant, the title of the 
invention, the reference number and the 
additional information affixed to the 
container. If the computer readable form 
Is submitted after the date of filing 




under 35 U.S.C. 111. after the date of 
entry In the national stage under 35 
U.S.C. 371 or after the lime of filing. In 
the United Slates Receiving Office, an 
international application under the PCT, 
the labels mentioned herein must also 
Include the date of the application and 
the application number/including series 
code and serial number. 

5 1-825 Am#f>dm*nU to Of r»pUc*m«nt o< 
»«9o#fy:« btino mnd compulaf r»adabU 
cof>y th«r»of, 

(a) Any amendment to the paper copy 
of the "Sequence Usting" ({ 1.821(c)) 
must be made by the submission of 
substitute sheets. Amendments must be. 
accompanied by a statement that 
indicates support for the amendment in 
the application, as filed, and a statement 
that the substitute sheets include no 
new matter. Such a statement must be a 
verified statement If made by a person 
not registered to practice before the 
Office. 

(b) Any amendment to the paper copy 
of the "Sequence Listing," In accordance 
with paragraph (a) of this section, must 
be accompanied by a substitute copy of 
the computer readable form ({ 1.821(e)) 
including all previously submitted data 
with the amendment incorporated 
therein, accompanied by a statement - 
that the copy In computer readable form 
Is the same as the substitute copy of the 
"Sequence Listing." Such a statement 
must be a verified statement If made by 
a person not registered to practice 
before the Office. 

(c) Any appropriate amendments to 
the "Sequence Listing" In a patent, e.g.. 
by reason of reissue or certificate of 
correction, must comply with the 
requirements of paragraphs (a) and (b) 
of this sccllorL 

(d) If. upon receipt, the computer 
readable form is found to t>e damaged or 
unreadable, applicant must provide, 
within such time as set by the 
Commissioner, a substitute copy of the 
data In cornputer readable form 
accompanied by a statement that the 
substitute data Is identical to that 
originally filed. Such a statement must 
be a verified statement if made by a 
person not registered to practice before 

h f^ fYfft cft^ — ■ — " 

Appendix A — Sample Sequence Listing 
(1) GENERAL INFORMATION: 



(I) APPUCANT; Doe. |oan X. Doe. John Q 

(II) TITLE OF INVENTION: I»ol6tlon «nd 

Charscleriration of a Ccne Encoding ■ 
Protease from Paramecium ip. 

(III) NUMBER OF SEQl^^CES: 2 
(Iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEH:^th and Joaei 

(B) STREET: 123 In Street 

(C) CnT: SmaUlown 

(D) STATE: Any i ta te 

(E) aDUNTRY:liSA - 

(F) ZIP:'l2MS . 

(v) COKtPUTER READABLE FORM: 

(A) MEDIUM TYPE: DUkette. 3.50 inch. 600 
Kb itorage 

(B) COMPtJTER: Apple Macintosh 

(C) OPERATING SYSTEM: Mcintosh 5.0 
(DJ SOFTWARE: MacWdto 

(vi) CURRENT APPUCATION DATA 

(A) APPUCATION NUMBER: 09/999.999 

(B) FlUNC DATE: 2a-FEB-19C9 

(C) CLASSIFICATION: 999/99 
(vll) PRIOR APPUCATION DATA: 

(A) APPUCATION NITMBER: PCT/US88/ 
,99999 

(B) FIUNG DATE: 01-MAR-19a8 

(viil) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: SruIlK John A 

(B) REGISTRATION NUMBER: 00001 

(C) REFERENCE/DOCKET NUMBER: 01- 
0001 

(ixl TELECOKIMUNICATION 
INFORMATION: 
(A) TELEPHONE: (909) 999-0001 
(Bj TELEFAX: (909) 999-0002 
(2) INFORMATION FOR SEQ ID NO; 1: 

(I) SEQUENCE CHARACFERISnCS: 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic apld 

(C) StRANDEDNESS: single 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: gendmJc DN A 

(III) HYPOTHETICAL: yci 
(Iv) ANTI -SENSE: ho 

(vl) ORIGINAL SOURCE: 
( A) tiRCANISM: Paramedum sp 
(CJ INDfVTOUAL/ISOLATE: XY2:2 
(G) CELL TYPE: uniccUolar organism 

(vil) IMMEDIATE SOimCE: 
(A) LIBRARY: genomic 
(Bj CLONE: Pani-XY2:2/36 

(x) PUBUCATION INFORMAtlON: 

(A) AUTHORS: Do«. Joan X Doe. john Q 

(B) TITLE: Isolation and Characteriration 
of a Gene Encoding a-Prolease from 
Paramecium sp. 

(C) JOURNAL: FicUonal Genes 

(D) VOLUME: I 

(E) ISSUE: 1 

(F) PACES: 1-20 

(G) DATE: 02-MAR-19e8 

(K) RELEVANT RESIDUES IN SEQ ID NO: 
1: FROM 1 TO 954 

DUJJNQ COOC 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: I: 

ATCGGGATAG TACTGGTCAA GACCGGTGGA CACCGGTTAA CCCCGGTTAA GTACCGGTTA 60 

TAGGCCATTT CAGGCCAAAT GTGCCCAACT ACGCCAATTG TTTTGCCAAC GGGCAACGTT 120 

ACGTTCGTAC GCACGTATGT ACCTAGGTAC TTACGGACGT GACTACGGAC ACTTCCGTAC 180 

GTACGTACGT TTACGTACCC ATCCCAACGT AACCACAGTG TGGTCGCAGT GTCCCAGTGT- . 240 

• •• K 

ACACAGACTG CCAGACATTC TTCACAGACA CCCC ATG ACA CGA CCT GAA CGT CTC 295 

Met Thr Pro Pro Glu Arg Leu 
-30 

i 

TTC CTC CCA AGG GTG TGT GGC ACC ACC CTA CAC CTC CTC CTT CTG GGG 343 
Phe Leu Pro Arg Val Cys Gly Thr Thr Leu His Leu Leu Leu Leu Gly 
-25 -20 -15 

CTG CTG CTG GTT CTG CTG CCT GGG GCC CAT GTGAGGCAGC AGGAGAATGG 393 
Leu Leu Leu Val Leu Leu Pro Gly Ala His 
-10 -5 

GGTGGCTCAG CCAAACCTTG AGCCCTAGAG CCCCCCTCAA CTCTGTTCTC CTAG GGG 450 

Gly 

CTC ATG CAT CTT GCC CAC AGC AAC CTC AAA CCT GCT GCT CAC CTC ATT 4 98 
Leu Met His Leu Ala His Ser Asn Leu Lys Pro Ala Ala His Leu lie 
15 10 15 

GTAAACATCC ACCTGACCTC CCAGACATGT CCCCACCAGC TCTCCTCCTA CCCCTGCCTC 558 

AGGAACCCAA GCATCCACCC CTCTCCCCCA ACTTCCCCCA CGCTAAAAAA AACAGAGGGA 618 

GCCCACTCCT ATGCCTCCCC CTGCCATCCC CCAGGAACTC AGTTGTTCAG TGCCCACTTC 678 

TAC CCC AGC AAG CAG AAC TCA CTG CTC TGG AGA GCA AAC ACG GAC CGT 726 
Tyr Pro Ser Lys Gin Asn Ser Leu Leu Trp Arg Ala Asn Thr Asp Arg 
20 25 30 

GCC TTC CTC CAG GAT GGT TTC TCC TTG AGC AAC AAT TCT CTC CTG GTC 77^ 
Ala Phe Leu Gin Asp Gly Phe Ser Leu Ser Asn Asn Ser Leu Leu Val 
35 40 45 

TAGAAAAAAT AATTGATTTG AAGACCTTCT CCCCATTCTG CCTCCATTCT GACCATTTCA 834 

GGGGTCGTCA CCACCTCTCC TTTGGCCATT CCAACAGCTC AAGTCTTCCC TGATCAAGTC 894 

ACCGGAGCTT TCAAAGAAGG AATTCTAGGC ATCCCAGGGG ACCCACACCT CCCTGAACCA 954 

BILUNO COOC 1SI»-I»< 
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(2) INFORMATION F OR SE Q ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: &2 «mlno acid* 

(BJ TYPE: •mino add 
(DJ TOPOIjOGY: linear 

(ii) MOLECULE TYPE; proleln 
(U) FEATURE: 

(A) NAME/KEY: »lgnal »equcnc« 

(B) UXATION: lo -1 



(C) IDENTinCATION METHOD: ainUIarity 
to othar ilgnal *€qucnce«. hydrophobic 

(Dj OTHER INFORMATION: expreiiet 
proteaM 

(k) PUBUCATION INFORMATION: 

(A) AUTHORS: Do€, Joan X. Doe. John Q 

(B) TTTLE: Uolatlon and Gharactcrixatlon 
of a Cena Encoding a Prolea»« from 
Paramecium up*' 



(G) lOURNAL: Fictional C«n«s 
(DJ VOLUME: I 

(E) ISSUE: 1 

(F) PACES: 1-20 

(G) DATE: 02-MAR-l Wa 

(K) RELEVANT RESIDUES IN SEQ ID NO: 
2:FKOM-04TO« 



H 



^^^^ LxJ^^^SL^— ^^-^-^ 



CO- 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 



Met Thr Pro Pro Glu Arg Leu Phe Leu Pro Arg Val Cys Gly Thr^Thr 
"30 "25 -20 

Leu His Leu Leu Leu Leu Gly Leu Leu Leu Val Leu Leu Pro Gly Ala 
-15 -10 "5 

His Gly Leu Met His Leu Ala His Ser Asn Leu Lys Pro Ala Ala His 
1-5 10 

Leu He Tyr Pro Ser Lys Gin Asn Ser Leu Leu Trp Arg Ala A$n Thr 
15 20 25 30 

Asp Arg Ala Phe Leu Gin Asp Gly Phe Ser Leu Ser Asn Asn Ser Leu 
35 40 45 

Leu Val 

BILLING COOC 3$I0-1«-C 



